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Abstract 

This paper studies dynamic stochastic optimization problems parametrized 
by a random variable. Such problems arise in many applications in op- 
erations research and mathematical finance. We give sufficient conditions 
for the existence of solutions and the absence of a duality gap. Our proof 
uses extended dynamic programming equations, whose validity is estab- 
lished under new relaxed conditions that generalize certain no-arbitrage 
conditions from mathematical finance. 



1 Introduction 

Let (n,J-,P) be a probability space with a filtration (J 7 t)f =0 (an increasing 
sequence of sub-sigma-algebras of JF) and consider the dynamic stochastic op- 
timization problem 

minimize Ef{x{ui),u{u)),u)) over x S J\f, (P) 

where, for given integers nt and m 

N = {(xt)J =0 \x t G L°(Q, J- t ,P;R" f )}, 

u 6 L°(Q,J-, P; M m ) and / is an extended real- valued convex normal integrand 
on R n x R m x il, where n = n + ... + n T . Recall that L°(Q, T t , P; K nt ) 
denotes the space of equivalence classes of J^-measurable M™ 4 -valued functions 
that coincide P-almost surely. That / is a normal integrand, means that the 
set-valued mapping uj i— y {(x, u, a) | f(x, u, uj) < a} is closed- valued and T- 
measurable; see e.g. [23l Chapter 14]. This implies that / is T ® B(M. n x 
R m )-measurable (see |23[ Corollary 14.34]), so that uj h-> f(x(ui),u(uj),uj) is T- 
measurable for every x S M . Throughout this paper, the expectation is defined 
for any measurable function by setting it equal to +00 unless the positive part 
is integrable. We will also assume that T as well as T t for t = 0, . . . , T are 
complete with respect to i^. 



1 This allows us to use certain results on conditional expectations of integrands which are 
not necessarily normal in the general case. This is based on 23 Corollary 14.34], which says 
that, when T is P-complete, then a function f : Q X R n X R m -> 1 is a normal integrand if 
and only if it is J-"<g> B(R n X R m )-measurable and (x,u) i-> f(ui, x,u) is lower semicontinuous 
for every uj. 
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The measurable function u may be interpreted as a parameter or a pertur- 
bation in a given stochastic optimization problem. It was shown in |17j that 
(|Pj) covers many important problems in operations research and mathematical 
finance and how the conjugate duality framework of Rockafellar [21] allows for 
a unified treatment of many well-known duality frameworks. In that context, 
the lower-semicontinuity of the value function 

ip(u)= inf Ef(x(w),u(w),w) 
xeN 

over an appropriate space of measurable functions u is equivalent to the absence 
of a duality gap; see [TT1 Section 2] for a precise statement. In certain appli- 
cations, most notably in mathematical finance, the objective in ([P]) lacks the 
inf-compactness properties required by the classical "direct method" of calculus 
of variations for establishing lower semicontinuity (and the existence of solu- 
tions) . It was shown in [TTj Section 5] how certain measure theoretic techniques 
from mathematical finance can be combined with classical techniques of convex 
analysis to obtain the lower semicontinuity of ip. It is essential for this that the 
strategies x € Af are allowed to be general measurable functions not restricted 
to be e.g. integrable. The lower semicontinuity result given in |17j . however, 
applies to normal integrands / that take only the values and +oo. While that 
already covers some fundamental results in mathematical finance, as illustrated 
in |17i Section 6], it is far from satisfactory from the general point of view. 

The main purpose of this paper is to establish the lower semicontinuity of 
ip for more general normal integrands. This will be done in Section [3] Our 
proof extends that of [171 Theorem 8], which employs a recursive argument 
reminiscent of dynamic programming. We clarify this connection in Section [2] 
by generalizing the dynamic programming equations proposed by Rockafellar 
and Wets |22j for stochastic convex optimization. The dynamic programming 
equations were substantially generalized already by Evstigneev [5] who removed 
many of the assumptions made in |22j . including convexity. We will show that 
in the convex case, the inf-compactness assumption made in both |22j and [5| 
can be replaced by weaker "recession condition" which subsumes, in particular, 
various no-arbitrage conditions used in mathematical finance. An early appli- 
cation of recession analysis to utility maximization in financial markets can be 
found in Bertsekas pQ. Section|4]of this paper gives an application to an optimal 
consumption problem in illiquid markets. 

2 Dynamic programming 

The purpose of this section is to extend the dynamic programming recursion 
of [221 Section 3] which generalizes the classical Bellman equation for convex 
stochastic optimization. We will use the notion of a conditional expectation of 
a normal integrand much as in [9] where certain assumptions (convexity, nonan- 
ticipativity of the domain of / and the boundedness of the strategies) of [22] 
were relaxed. We show that, in the convex case, the inf-compactness assumption 
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used in both [25] and [5] can be replaced by a milder condition on the directions 
of recession much like in the classical closedness results of finite-dimensional 
convex analysis; see [501 Section 8]. In certain financial applications, the new 
condition turns out to be equivalent to the classical no-arbitrage condition. 

Let X be a nonnegative J^-measurable function and let Q C T be an- 
other sigma-algebra. Then, there is a (^-measurable nonnegative function E g X, 
unique up to sets of P-measure zero, such that 

E[ XA X]=E[xa{E g X)] (1) 

where \A denotes the characteristic function of A; see e.g. Shiryaev [241 II. 7]. 
The function E g X is called the Q- conditional expectation of X. For a general 
J 7 - measurable extended real- valued function X, we set 

E g X := E G X+- E g X^, 

where again, the convention oo — oo = oo is used. It is easily checked that with 
the extended definition of the integral, ([T]) is then valid for any measurable func- 
tion X. Our definition of conditional expectation extends [24j Definition II. 7.1], 
which assumes that min {E G X+,E g X-} < oo almost surely. Our choice of set- 
ting oo — oo = oo is not arbitrary but specifically directed towards minimization 
problems. 

The Q- conditional expectation of a normal integrand h is a C/-measurable 
normal integrand E g h such that 

(E g h)(x(cj),u) = E g [h(x(-),-)](u) P-a.s. 

for all x e L°(Tt, Q, P;R n ). There are various conditions that guarantee the 
existence and uniqueness of a conditional expectation of a normal integrand; 
see e.g. Bismut [4], Dynkin and Evstigneev [8], Castaing and Valadier [5j Sec- 
tion VIII.9], Thibault [25], Truffert [H] or Choirat, Hess and Seri [BJ. The 
following suffices for the purposes of this paper. 

Lemma 1. Let Q C T be a sigma-algebra and assume that h is an J- -normal 
integrand with an integrable lower bound i.e. an integrable function m such that 
h(x,uj) > m(uj) for every x and uj. Then h has a well-defined conditional 
expectation E g h which has the integrable lower bound E G m. 

Proof. The integrable lower bound implies, for example, the quasi-integrability 
condition of Thibault [3S] as well as the condition of Choirat, Hess and Seri [B], 
both of which give the existence and uniqueness of the conditional expectation. 
It follows from the monotonicity of the conditional expectation that if / > m 
for an integrable function m, then E g f > E g m. □ 

We will study problem fP]) for a fixed u £ L°(uj, T, P; R m ) so we will omit 
it from the notation and define 

h(x,w) = f(x,u(uj),uj). 
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By [531 14.45(c)], ft. is a normal integrand. The convexity of / implies that of h. 
We will use the notation E t — E^ 1 and x l — (xq, ■ ■ ■ ,Xt) an d define extended 
real- valued functions h t ,h t : R™ 1+ '"+' lt x ft — > R recursively for t = T, . . . , by 

fix = h, 

h t =E t ht, (2) 
ht-iix^ 1 ,^) = inf /i t (x t_1 ,a; t ,cj). 

This is essentially the dynamic programming recursion introduced in [2 2) . Our 
formulation with conditional expectations of normal integrands is closer to [9] , 
where certain assumptions of |22j were relaxed. In the above formulation, one 
does not separate the decision variables Xt into "state" and "control" like in the 
classical dynamic programming models; see e.g. [3J and [2]. A formulation closer 
to the classical dynamic programming equations will be given in Corollary @] 
below. A recent application of dynamic programming to mathematical finance 
can be found in Rasonyi and Stettner [19l Section 5]. 

In order to ensure that h t and h t are well-defined it suffices to require that 
the function h has an integrable lower bound and that h(-,uj) is inf-compact 
(i.e. {x £ R™ | h(x, w) < a} is compact for every a £ R) for every ui £ Q: sec [9, 
Theorem 5] . In the convex case, the compactness assumption can be replaced by 
a weaker condition stated in terms of the recession function of h. If dom/i(-,a;) 
is nonempty, then the recession function has the expression 

h(Xx + x, u) - h(x, u) 

h [x, uj) — sup , 

a>o A 

which is independent of the choice of x £ domft(-,cj); see [3D1 Theorem 8.5] 
or 3.21]. By [531 Exercise 14.54(a)], the function h°° is a convex normal 
integrand. If h(-, ui) has an integrable lower bound, then h°°(x, uj) > for every 
x £ R n as is easily seen by letting A — >• oo. 

Lemma 2. Assume that h t is a normal integrand and that the set-valued map- 
ping 

N t (u) = {x t £ R™ 4 | h^(x\uj) < 0, x 1 - 1 = 0} 
is linear-valued. Then h t ~\ is a normal integrand with 

h^Z 1 (x t - 1 ,u>)= inf hTix^iX^u). 

Moreover, given anx £ Af, there is an J- \ -measurable x t such that Xt(uj) _L N t (uj) 
and 

ft, t „i(a; t_1 (iu),iu) = /i t (x* _1 (a;),xt(tj),tj). 

Proof. By [20l Theorem 9.2], the linearity condition implies that the infimum in 
the definition of ht-\ is attained and that h t -i(-,Lu) is a lower semicontinuous 
convex function with 

h?L 1 (x t - x ,u)= inf hTix^^t,^). 

i t 6R"t 
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By [331 Proposition 14.47], the lower semicontinuity implies that h t -\ is an Tt- 
measurable convex normal integrand. By [23j Proposition 14.45(c)], the function 
p(x,u>) := /it(x* _1 (a;),x,a;) is then also an J^-measurable normal integrand so, 
by [23, Theorem 14.37], there is an Tt -measurable x t that attains the minimum 
for every v. By [501 Corollary 8.6.1], the value of /i t (x* _1 (a;),x,a;) does not 
change if we replace Xt(u) by its projection to the orthogonal complement of 
N t (uj). By 23, Exercise 14.17], such a projection preserves measurability. □ 

It is clear that if ht has an integrable lower bound, then so will h%—\. Ap- 
plying Lemmas [T] and [2] recursively backwards for t = T, . . . , 0, we then see that 
if h has an integrable lower bound, the functions ht and ht are well-defined for 
every t provided that Nt is linear-valued at each step. 

We now get the following refinement of the optimality conditions in [351 
Theorem 1] and [9j Theorems 1 and 2] in the convex case. 

Theorem 3. Assume that h has an integrable lower bound and that N t is linear- 
valued for t = T, . . . , 0. The functions h t are then well-defined normal integrands 
and we have for every x G M that 

Eht(xt{u),u>)>in£QH t = 0,...,T. (3) 

Optimal solutions x G J\f exist and they are characterized by the condition 

x t {uj) G argmin h t (x t ~ 1 (ui),x t7 uj) P-a.s. t = 0,...,T. 

which is equivalent to having equalities in (|3|). Moreover, there is an optimal 
solution x G J\f such that xt -L Nt for every t = 0, . . . , T. 

Proof. As noted above, a recursive application of Lemmas [T] and [2] imply that 
the functions h t and ht are well-defined normal integrands. Given an x G A/", 
the law of iterated expectations (see e.g. Shiryaev [2H Section II. 7]) gives 

Eht(x*(u), u) > Eht-iix*- 1 ^), u) = Eht-ti^' 1 ^), w) t= 1, . . . , T. 
Thus, 

Eh(x(uj),uj) — Ehx(x T (lu),u}) > Eh$(x° (uj),uj) > E inf /io(xo,w), 

i 6t"o 

where the inequalities hold as equalities if and only if 

htix^uj),^) = h t - 1 {x t ~ 1 (uj),uj) P-a.s. t = 0,...,T. 

The existence of such an x G Af with xt -L Nt follows by applying Lemma [2] 
recursively for t = 0, . . . , T. □ 

When the normal integrand h has a separable structure, the dynamic pro- 
gramming equations (J5J can be written in a more familiar form. 
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Corollary 4 (Bellman equations). Assume that 

T 

h(x,uj) = ^ h(xt-i,x t ,u) 

t=0 

for some fixed initial state X-\ and J- t -measurable normal integrands h t with 
integrable lower bounds. Consider the functions Vt : R™' x Q — > R given by 

V t (x t ,uj) = 0, 

V t -\{x t -i,u) = inf {k t {x t _i,x t ,u) + V t (x t ,u)}, (4) 

Vt-i = E t _xV t -x 

and assume that the set-valued mappings 

N t (uj) = {x t e R nt \k^°(fl,xt,u) + V t °°(x t ,Lu) < 0} 

are linear-valued for each t = T, ...,0. The functions V t are then well-defined 
normal integrands and we have for every x S Af that 



E 



} J k s {x s -i[w),x s {u),u) + Vt(x t (ui),uj) 



s=0 



> inf © t = 0, . . . , T. (5) 



Optimal solutions x € J\f exist and they are characterized by the condition 
x t {u) E argmin{fc t (x t _i(w),a; t ,cj) + V t (x t ,u))} P-a.s. t = 0,...,T, 

which is equivalent to having equalities in ([5]). Moreover, there is an optimal 
solution x 6 M such that x t _L N t for every t — 0, . . . , T. 

Proof. By Theorem [3J it suffices to show that 



h t {x t i ui) = 2J k s (x s -i,x s ,u>) + V t {x t ,oj) 



(6) 



for every t = 0, . . . ,T. For t = T, ([BJ is obvious since Vt = by definition. 
Assuming that © holds for t, we get 

ht-iix 1 ' 1 ,u}) = inf htix*' 1 ,x t ,uj) 



y^k s (x 8 - 1 ,x a ,u})+ inf {k t (x t -i,xt,u)) + Vt(x t ,u))} 



s=0 
t-1 



y^fc s (a: s -i,x s ,a;) + ^-1(^-1,0;) 

s=0 



G 



and then, since for s = 0, . . . ,t — 1, k s is J-~ t _i-measurable, 

t-i 

/i t _i(a; t_1 ,a;) = ^ k s {x s -i,x s ,u) + Vt-i(x t -i,u), 

s=0 

where, by Lemma [TJ Vt is a well-defined normal integrand when N t is linear- 
valued. □ 

The rest of this section is devoted to the study of the linearity condition 
in Theorem [3] Recall that a Q -measurable selector of an R"-valued set- valued 
mapping C is a (y-measurable function x such that x(tu) s C'(w) almost surely. 

Lemma 5. Let Q Q J- be a sigma- algebra and assume that h is an T-normal 
integrand with an integrable lower bound. If there is an x € L°(fl, Q, P; R") 
such that Eh(x(oj),uj) is finite, then (E g h)°° = E g h°° and the level sets 

lev h°°{uj) = {ie R n \h°°{x,uj) < 0}, 
levo^/i) 00 ^) = {x G R™ | (E G h) 00 (x,oj) < 0} 

/iai>e i/ie same Q -measurable selectors. 

Proof. By [23 , Exercise 14.54], h°° is a well-defined ^-normal integrand. More- 
over, the lower bound on h implies that h°° is nonnegative. By Lemma [TJ E g h 
and E g h°° are thus well-defined. To show that the latter is the recession func- 
tion of the former, let x € L°(fl, Q, P; R ra ) and A £ Q. Convexity of h implies 
that the difference quotient 

h(x(uj) + Xx(lo),lu) — h(x(uj), ui) 
A 

is increasing in A for every to; see e.g. [201 Theorem 23.1]. The lower bound on 
h and the integrability of h(x(-), •) thus imply that, for A > 1, the quotients are 
minorized by a fixed integrable function. Monotone convergence theorem then 
gives for every A e Q 

E[l A h°°(x)} = E[1 A lim (h(x + Xx) - h(x))/X] 
= lim E[l A (h(x + Xx) -h(x))/X] 

A/'oo 

= lim E[l A ((E G h)(x + Xx) - {E g h)(x))/X} 

A/'oo 

= E[1 A lim {{E g h){x + Xx) - [E e h)(x))/X] 

A/'oo 

= E[l A (E s h)°°(x)}, 

which means that (E g h)°° is the conditional expectation of h°°. 

To prove the last claim, let a; € L°(£l,Q, P;R"). By the first claim and the 
definition of a conditional integrand, 

(E°h°°)(x{.),-)=E°h°°{x(-),-). 
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We have h°°(x(u),u)) < almost surely if and only if E^h°°(x(-), •) < almost 
surely, since h°° > 0. □ 

Remark 1. Consider the parametric problem |P]) and assume that u G L°(tt, J 7 , P; R" 1 ) 
is such that h(-,uj) = /(•, u{uS), uj) is proper. We then have 

h°°(x,uj) = f°°(x,Q,uj), 

where /°°(-, •, uj) is the recession function of /(•, -,uj). It follows that, as soon as 
they are well defined, the recession functions and and thus, the mappings 
N t are independent of the choice of u G dome/?. Indeed, u G dovaip implies that 
there is an x G j\f such that h(x(u)),uj) = f (x(lu) , u(lu) , lu) < oo almost surely. 
Recursive application of Lemmas [H and [5| then shows that h t and h t can be 
expressed in terms of h°° , which is independent of u G dom<£>. 

The following result shows that the linearity condition of Theorem [3] can be 
stated in terms of the original normal integrand h directly. In the proof, we will 
denote the set of (/-measurable selectors of a set- valued mapping C by L°(Q; C). 
We will also use the fact that if C is closed- valued and (/-measurable, then it is 
almost surely linear- valued if and only if the set of its measurable selectors is a 
linear space. This follows easily by considering the Castaing representation of 
C; see e.g. [23l Theorem 14.5]. 

Lemma 6. Assume that h has anintegrable lower bound and that Eh{x{ui) , lj) < 
oo for some x G A/". Then h t is well-defined and N t is linear-valued for t — 
T, . . . , if and only if 

L = {xeAf\ h°°(x(uj), uj)<0 a.s.} 
is a linear space. If x G L is such that x'" 1 = then Xt G Nt almost surely. 

Proof. Redefining h{x,u>) := h[x — x(u)), uj), we may assume that 5 = 0. Indeed, 
such a translation amounts to translating the functions h t and h t accordingly 
and it does not affect the recession functions Kj° and h^°. We proceed by 
induction on T. When T = 0, Lemma [5] gives 

L = {x G Af\ h^(x(uj),uj) < a.s.} = I?{T T ;N T ). 

Since Nt is J^-measurable, the linearity of L is equivalent to Nt being linear- 
valued. Let now T be arbitrary and assume that the claim holds for every 
(T — l)-period model. 

If L is linear then L' = {x G W| xq = 0, h°° (x(uj),uj) < a.s.} is linear as 
well. Applying the induction hypothesis to the (T — l)-period model obtained 
by fixing xq = 0, we get that Nt is linear for t = T, . . . , 1. Applying Lemmas [I] 
and [2] backwards for s = T, . . . , 1, we then see that ho is well defined. Lemmas [5] 
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and [5] give 



L°(T ;N ) = {x G L°(T ) \ h™{x {u),u) < a.s.} 
= {x G LVo) I /^(itoM.w) < a.s.} 
= {x S L°{Tq) I inf /if (xoM,xi,u;) < a.s.} 

= {x G i°(Jb) | 3x G jV : x = %o, hfix 1 ^), u) < a.s.}, 

where the last equality follows by applying the last part of Lemma [2] to the 
normal integrand h°°. Repeating the argument for t = 1, . . . , T, we get 

L°(J"o; A^o) = {x G L°(T ) 3x G A/" : x = a?o, ft§?(jc(w), w) < a.s.} 
= {x G L°(Jb) 3x G A/" : .x = zq, h°°(x(u)), u>) < a.s.} 
= {x G L°(Jb) 3x G L : £ = zo}- (7) 

The linearity of L thus implies that of L (.Fo;iVo) which is equivalent to No 
being linear- valued. 

Assume now that N t is linear- valued for t = T, . . . , and let x G L. Expres- 
sion ([7} for i°(J r ;iVo) is again valid so, by linearity of No, there is an x G L 
with xo = ~xo- Since h°° is sublinear, L is a cone, so that x + x G L. Since 
xo + xo = 0, we also have x + x 6 L . Since, by the induction assumption, 
L is linear and since L C L, we get — x — x G L. Since L is a cone, we get 
— x = x — x — xGL. Thus, L is linear. 

For t = 0, the last claim follows directly from expression (J7J. The general 
case follows by applying this to the (T — t)-period model obtained by fixing 
x*- x =0. □ 

When h is the indicator function of a convex set, the linearity condition in 
Lemma |5] becomes the linearity condition of [1 71 Theorem 8] which generalizes 
various no-arbitrage conditions that have been used in mathematical finance. 
The following example illustrates the situation in the classical perfectly liquid 
market model; see [17j for more general models. 

Example 1 (Superhedging in liquid markets). Let S — (St)f =Q be an R d -valued 
(Ft)J = o- adapted stochastic process, nt = d, m = 1 and 



f(x,u,uj) 

We get 




otherwise. 



if u G C, 



ip(u) = inf I f (x,u) = < 

xeN I -\-oo otherwise, 

where C = {u 6 L° \ 3x G Af : ~Yiit=o x t 1 ASt > w}. /n t/ie classical perfectly 
liquid model of financial markets, where S gives the unit prices of the "risky 
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assets" and x% is the portfolio held over (t,t + 1], the set C consists of the con- 
tingent claims that can be superhedged without a cost; see e.g. [7| Section 6.4]. 
Since f is a closed positively homogeneous function, we have f°° = f and 

T-1 

{x €Af\f°°{x(uj),0,uj) <0 a.s.} = {x G Af\ x t ■ AS t > 0}. 

t=o 

This set is linear, and thus, the function h(x,oS) = f{x,u{uj),uj) satisfies the 
linearity condition in Lemma® if and only if the price process S satisfies the 
no-arbitrage condition. 

The following simple example goes beyond indicator functions and also of 
inf-compact integrands considered in |H] • 

Example 2 (Variance optimal hedging). Let S — (S t )f =0 be an R d -valued 
(J-'t)J =0 - adapted stochastic process, u 6 L°(£l, J 7 , P; R) and consider the problem 
of minimizing 

T-1 

E{V + Y, z f AS t+i - uf 

t=0 

overVo € M. and J- \ -measurable M. d -valued functions z t . This corresponds to (|Pj 
with xq = (zq, Vq), Xt — z t for t = 1, . . . , T and 

T-1 

f{x, u, u) = (Vo + Y, z f A &+i(w) - uf. 

t=0 

The above problem has been studied e.g. in Follmer and Schied ]10\ Section 10.3], 
where V$ is interpreted as an initial value of a self- financing trading strategy 
where z t is the portfolio of risky assets held over period [t, t + 1] . By ]20\ Theo- 
rem 9.4], 

f00 , A _ jo ifV + Efco 1 z t ■ A5 t+ i(w) - u = 0, 
I +00 otherwise. 

By Remark [7J the function h(x,U)) — f(x,u(u>),ui) then satisfies the linearity 
condition of Lemma® so the optimal solution is attained. This should be com- 
pared with the existence results in J 101 Section 10.3], where it was assumed that 
d=l. 

3 Lower semicontinuity of the value function 

We now return to the parametrized problem (|P|). Being the inf- projection of 
the convex integral functional 

If(x,u) = Ef(x(oj),u(oj),u), 
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the value function 

ip{u)= inf Ef(x(u),u(u),u) 

is convex on L (fi,-F, P;R m ); see e.g. [21] Theorem 1]. Our aim is to give 
conditions under which ip is lower semicontinuous on certain locally convex 
topological vector subspaces of L°(fi, F, P; M m ). The lower semicontinuity is 
equivalent to the absence of a duality gap in the duality framework of |17j (which 
is essentially an instance of the conjugate duality framework of Rockafellar [21] ) 
which we now briefly recall (and slightly generalize) . 

Assume that U and y are vector subspaces of i°(fi, F, P; M m ) in separating 
duality under the bilinear form 

(u,y) = E[u(ui) ■ y(u>)], 

i.e. that E[u{lS) ■ y(u))] is finite for every b£M and y G y and that for every 
nonzero u G U (resp. y G y), there is at least one y £ y (resp. u £ U) such 
that (it, y) ^ 0. The special case U = LP and y — L q was studied in [17]. The 
weakest and the strongest locally convex topologies on U compatible with the 
pairing will be denoted by a(U,y) and r(U,y), respectively. Since the value 
function ip is convex, we have by the classical separation argument, that ip is 
lower semicontinuous with respect to cr(U, y) if it is merely lower semicontinuous 
with respect to r(U,y). When U = U> and y = L q for p e [1, oo), r{U,y) is 
simply the norm topology on U and aiU,y) the weak topology. A general 
treatment of topological spaces in separating duality can be found e.g. in Kelley 
and Namioka |14j . 

The Lagrangian associated with fP]) is the extended real- valued function 
L(x,y) = M{I f (x,u) - (u,y)} 

on M x y. The Lagrangian is convex in x and concave in y. The dual objective 
is the extended real- valued function on y defined by 

g(y) = inf L(x,y). 

The basic duality result [U Theorem 7] says, in particular, that g — —<p*. 
When ip is lower semicontinuous and proper, the biconjugate theorem (see e.g. 
[21] Theorem 5]) then gives the dual representation 

ip(u) = sup{(w,y) +g(y)}. (8) 

It was shown in \17\ that this abstract result is behind many duality frameworks 
in stochastic optimization and mathematical finance. 

It was assumed in [17] that U = LP and y — L q , but the main result [17l 
Theorem 3] remains valid as long as the space U is decomposable in the sense 
that 

Xau + Xn\AU G U 
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whenever A £ J 7 , u EU and v! £ L°°(Q, J 7 , P; E m ). Indeed, the decomposability 
property allows the use of the interchange rule for minimization and integration 
(see [331 Theorem 14.60]), which suffices for the proof of [TTJ Theorem 3]. We 
also note that decomposability of the spaces U and y implies that the separa- 
tion property holds automatically for the bilinear form defined above; see [271 
Lemma 6]. Moreover, we have the following relations for relative topologies. 

Lemma 7. If U and y are decomposable, then L°° Cl/CL 1 and 
<r(£\L°°)| W C a(U,y), a(U,y)\ L ~ C a(L°° ,L 1 ), 

t(l\l°")\ u c T(u,y), T{u,y)\ L ~ c t^l 1 ). 

Proof. By [4, Lemme 1, p.??], L°° C U C L 1 and L°° C y C L 1 which give the 
relations for the a-topologies. Since, by symmetry, analogous relations are valid 
for the cr-topologies on y, we have that a{L°° , L 1 )-compact subsets of L°° are 
cr(3 ; ,^/)-compact. Since, by the Mackey-Arens theorem, t(JA, y) is generated by 
the support functions of a(y, W)-compact sets, we get r(i 1 ,i°°)|^ C r(U,y). 
The remaining inclusion is verified similarly. □ 

The traditional "direct method" for proving the lower semicontinuity would 
be to assume that the integral functional If is uniformly inf-compact in x with 
respect to an appropriate topology on N '. If the topology is strong enough 
to imply the almost sure convergence of a subsequence, the sequential lower 
semicontinuity can often be derived from Fatou's lemma. In certain applica- 
tions, this purely topological argument fails because If lacks an appropriate 
inf-compactness property in x. In convex problems, the following measure the- 
oretic result can sometimes be used as a substitute for compactness. 

Lemma 8 (Komlos' theorem). Let (x v )'£L l be a sequence in L°(Ct, J 7 , P; K") 
which is either 

1. bounded in L , 

2. almost surely bounded in the sense that 

sup |x"(<j)| < oo P-a.s. 

V 

Then there is a sequence of convex combinations x v G coja^ fi > v\ that 
converges almost surely to an K" -valued function. 

Proof. See e.g. [7] or [12]. □ 

The following is our main result. 

Theorem 9. Assume that there is ay e J and an m £ L 1 (J7,J r , P) such that 
for P-almost every lo, 

f(x, u,uj)>u- y(ui) + m(uj) V(x, u) £ M™ x M m 
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and that {x G J\f\ f°°(x(oj), 0, uj) < a.s.} is a linear space. Then 

<p(u) = inf If{x,u) 

is lower semicontinuous on U and the infimum is attained for every u G IA. 

Proof. Let h u (x,uj) — f(x,u(uj),uj). The lower bound on / implies that h u has 
an integrable lower bound. As noted in Remark (TJ h^(x,oj) = f co (x,0 7 uj) for 
every u, so the linearity condition on / implies that h u satisfies the linearity 
conditions in Lemma [51 By Theorem [31 the infimum in ip(u) — iid xe j\f If{x, u) 
is thus attained for every u G IA by an x G j\f with Xt{uj) _L N t (uj) almost surely. 

For lower semicontinuity, it suffices to show that ip is lower semicontinuous 
on the linear space L 1,y = {u G L 1 \E[u ■ y]\ < oo} with respect to the norm 
|H|l 1 .!' = E\u\ + \E[u ■ y]\. Indeed, since y G y, we have U C L 1 ^ and, by 
Lemma[71 the norm 1 1 • | is continuous on t(U, y), which means that t(U, y) 
is stronger than the norm topology restricted to U. Since L l ' v is a normed space, 
it suffices to prove sequential lower semicontinuity, which means that for any 
7 G R and for any sequence such that 

ip{u v ) < 7 

and u" — !> u in L 1 ^, we have <p(u) < 7. We will prove this by establishing the 
existence of an x G W such that If(x, u) < 7. 

As observed at the beginning of the proof, there is for every v an x" G N 
such that x v t _L N t and 

//K,^)< 7 . 

Moreover, the mappings Nt are independent of u v ; see Remark [TJ Since u v 
converges in L 1,J/ , the lower bound on / implies that the negative parts of the 
functions lu H> f(x u (uj), v?(uj), uj) are bounded in L 1 . Since If(x u ,u v ) < 7, the 
positive parts must be bounded as well. Thus, by Lemma [SI there is a sequence 
of convex combinations 

00 

0» ^^a^/^fo;), «"(«),«) 

that converges almost surely to a real-valued measurable function. In particular, 
the function <j)(oS) := sup^ <p u {uS) is almost surely finite. Defining 

00 

{x u ,u u ) =^a v > fl {x tl : u 11 ) 

ii—v 

we have by convexity that 

f(x v (u}),u v (u), uj) < 4> u (uj) < 4>(ui) P-a.s. 

and If(x" ', u") < 7. Moreover, we still have x v t G iVj almost surely and u v — >• u 
in the L 1,!/ -norm. 
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Passing to a subsequence if necessary, we may assume that u v u almost 
surely, so that the measurable function p(w) := sup„ ^"(w)! is almost surely 
finite. Each (x v ,u v ) then belongs to the set 

C = {(x,u) eAf X L°\(x,u) eCa.s.}, 

where C(uS) = {(x,u)\x t G N^(w), u e p(o;)B, f(x,u,w) < 4>{u)}. We will 
now apply [T?] Theorem 6], which says that the sequence (a;^, u l/ )£5L 1 is almost 
surely bounded if 

{(x, u) G N x L° | (x, u) G C 00 a.s.} = {(0, 0)}. (9) 

By Corollary 8.3.3 and Theorem 8.7 of [2D], 

C°°H = {(a;,0) | art G A^H, /°°M,w) < 0}. 

If a; £ TV is such that f°°(x(u>), 0, w) < then, by the last part of Lemma[DJ we 
have Xq G A?o- The condition xo G then implies that xo = 0. Repeating the 
argument for t = 1, . . . ,T gives ([DJ so {x v , u v )^ =1 is almost surely bounded. 

By Lemma [5] there is a sequence (aV, u v ) r £L l of convex combinations of 
(x", u u )^L 1 that converges almost surely to a point (x,u), where necessarily 
u = u since u v — >• it almost surely. We still have — > ti in the i 1 ^-norm and, 
by convexity, If{x L ' ,u v ) < 7. By Fatou's lemma, 

- y(u) ■ u(u) - m(u)} 
< liminf E[f{x v {ui), u u {ui), u) — y{u>) ■ u"{uS) — m{uj)] 1 

V — ^00 

where E[y{w) ■ u u (cj)] — > E[y(cj) ■ u(u)}, by the L 1,J, -convergence, so that 
If(x, u) < liminf If(x v , u v ) < 7, 

v— >oo 

which completes the proof. □ 



4 An application to mathematical finance 

We will illustrate Theorem [9] on the optimal consumption problem considered 
in (TTJ Section 5] . The problem is set in a generalization of the market model of 
Kabanov , where a finite number d of securities is traded over finite discrete 
time t = 0, ...,T. At each time t and state lj G O, the market is described 
by two closed convex sets, Ct(u>) and D t (uj), both of which contain the origin. 
The set Ct(ui) consists of the portfolios that are freely available in the market 
and Dt(u>) consists of the portfolios that the investor is allowed to hold over the 
period [t, t + 1). For each t, the sets C* and Dt are assumed to be J^-measurable. 
Consider the problem 

T 

maximize E \^ UAct) 
( z ,c)eA/- ^ tV 11 (10) 

subject to zt — Zt-\ + ot G Ct, zt G Dt P-a.s. t = 0, . . . , T, 
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where Z—\ := 0, Dt(uj) := {0} and — Iff is a convex Ft -measurable normal inte- 
grand on R d x f2. This models an optimal consumption problem where at each 
time t and stage u> we can consume some of the assets and update the existing 
portfolio Zt-x- The combined process (z, c) is required to be self-financing in the 
sense that the sum of the portfolio update Az t := z t — Zt-x and the consumption 
vector ct has to be freely available in the market, i.e. it belongs to Ct{oS). In ad- 
dition, the portfolio constraint Zt(ui) G D t {uj) is required to hold almost surely 
at each time. Problem (fit)]) generalizes the classical optimal consumption prob- 
lem where the numeraire asset is consumed in a perfectly liquid market model 
(see Examples Q] and [2]) . A general treatment of the continuous-time model can 
be found in Karatzas and Zitkovic [13] . 
Defining 

C = {c e A | Az t + c t G C t , z t ED t P-a.s. t = 0, . . . , T}, 

where A denotes the set of Revalued adapted processes (so that Af = A X A), 
we can write problem (fTO)) compactly as 

T 

maximize E ^""^ Ut(ct) over c G C. (11) 
ceA t=o 

The set C can be interpreted as the set of consumption processes that can be 
super- replicated without a cost in the market given by the pair (C, D); compare 
with the definition of the set C in Example [T] 

In order to dualize the problem, we embed it in the general duality framework 
with, x t = (z t ,ct), u = (ut)t= and 



-J2l=o U t{ct,u) if Azt + ct + ut £ C t {u>), z t G D t (tu) 
-oo otherwise. 



Here ut G M. d so that the dimension of u equals m = (T + l)d. The Lagrangian 
integrand becomes 



l{ x ,y^)^ inf {f(x,u,u) - u ■ y} 

T 

= inf {- Y][U t {ct, u) + ut- y t ] \Az t + ct+u t e C t (w), z t G D t (u)} 

t=0 
T 

= inf {-y][Ut(ct,u) + (ut-Azt-ct)-yt]\ikeC t (u)), z t G D t (u)} 

t=0 

_ I -Y?t=o\Ut{ct, w) + c Ct(w )(y t ) - (Az t + ct) ■ y t ] if 2t G AM 
1 +oo otherwise 

= \-Tj= [Ut{ct,u) + (T Ct (u)(yt) + z t ■ Ay t+l - c t ■ y t ] if z t G D t (u) 
] +oo otherwise, 
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where crc t (u) denotes the support function of Ct(to). In the last equality we have 
used the "integration by parts" formula 

T T 

^2 Az t • y t = - ^2 z t ■ &Vt+i 
t=o t=o 

where yr+i := 0. 

Recall that, by Lemma y C L 1 . On the other hand, by [T71 Theorem 30, 
g(y)= inf El(x(u),y(w),u>), 

where Af°° — Af n L°° . We can then use the law of iterated expectations (see 
e.g. [241 Section II. 7]) and the interchange rule for integration and minimization 
(see e.g. [221 Theorem 14.60]) to write the dual objective as 

g{y) = , mf \Ej2l-u t (c t )~ac t (yt) 

- z t ■ E t Ay t+ i + c t ■ E t y t ] \ z t G D t a.s. ^ 



E Y1 inf „, {-^(Ct!^) - ^C?«(u)(mH) 

- z t • (E t Ay t+1 )(oj) + c t ■ (E t y t )(uj)\ z t G A(«)} 

T 



t=0 



where 



U?(y,u)= inf {c • y - U t (c,u)} 



is the conjugate of Ut in the concave sense. 

When Ct(w) and -Dt(w) are convex cones, we have 



'EYZ= U t *(E t y t ) if ye 2?, 
— oo otherwise, 



s(y) 

where 

£ = {y e^|y t e C?, B 4 Ay t g D* t }, 

where C t *(w) and D%(u>) are the polar cones of Ct{u:) and Dt(ui), respectively. 
The dual problem can then be written as 

T 

maximize E^~^ U^(y t ) over y G T> (12) 
t=o 



2 Theorem 3 of 1171 is stated for the case U = L p and y = L q , but its proof goes through 
in the general case without a change. 
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in symmetry with the primal problem (fTT|) . The (J r t )^ -adapted elements of 
T> are called consistent price systems for the market model (C, D); see [17l 
Example 4.2]. It follows from Jensen's inequality that the value of the dual 
objective does not decrease when replacing a general y G y by its {J- t )J =0 - 
adapted projection; see [TTJ Example 3.6]. 

In [17] . the lower semicontinuity of the value function associated with the 
optimal consumption problem was left open so it could not be claimed that the 
optimal values of (fTTj) and (IT21 are equal. With the help of Theorem [§1 we can 
now derive simple sufficient conditions. We will assume that the utility function 
Ut satisfies the growth condition 

V?(^) = \° (13, 
I — oo otherwise, 

for every u>. When d = 1 and Ut{-,uj) is smooth, this is equivalent to the 
conditions lim c _j. oc U' t (c,uS) — and lim c _ ! ._ 00 U{.(c,lu) = +oo which generalize 
the Inada conditions; see e.g. [15]. We will say that the market model (C,D) 
satisfies the condition of no-scalable arbitrage if 

{c e A+ 1 3z e A : Az t + c t e C t °°, ** e A 00 } = {0}, (14) 

where ,4+ denotes the set of componentwise nonnegative claim processes. This 
condition is related to arbitrage opportunities that can be scaled up by arbi- 
trarily large positive numbers; see [161 118) . 

Theorem 10. Assume that the optimal value of (|10[) or, equivalently, pip 
is less than oo, that Ut satisfy the growth condition (| 13[) and that there is an 
integrable function m such that Ut(c,ui) < m(ui) almost surely for every c £ R d 
and t = 0, . . . , T. If the market model (C, D) satisfies the no-scalable arbitrage 
condition (|14[) and if the set 

{zeA\Az t eC™, z t eD™}, 

is linear, then the optimal value of 

T 

maximize E Utjct) over c£C—u 
t=o 

is lower semicontinuous as a function of u £ U and the infimum is always 
attained. In particular, if C and D are conical, then the optimal value of the 
primal (jlip is the negative of the optimal value of the dual (|12p . 

Proof. By [2DJ Theorems 9.3 and 9.5], 

( 



f°°(x,u,u) = 



Et=o Ufict, u) if Az t +c t e q»(w), z* 6 A°°M 
oo otherwise, 
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so, by Lemma [5J the linearity condition of Theorem |H] means that the set 

T 

l = {{z,c) eAf\^2 ur(ct) > 0, Az t + ct e C t °°, * g A 00 } 
t-o 

is linear. Under the growth condition (|13|) . 

L = {(z, c) e | c t > 0, Az t + a e C t °°> e A 00 } 

If (z, c) 6 L, condition (|T4")) implies that c = 0, and then, by linearity of the set 
{z £ A Az t £ C t °°, Zt 6 we have (— z, — c) £ L. Since L is also a cone, it 

has to be a linear space. □ 

Remark 2. TTie conclusions of Theorem \10\ remain valid if, instead of the 
growth condition (Till]) and the no-scalable arbitrage condition (fill) . we assume 
that the set 

T 

{ceA\3zeA: ^ U?{c t ) > 0, Az t + c t e C t °°, z* e Df}, (15) 

t=0 

is linear. Indeed, if L is as in the above proof and {z,c) £ L, condition (| 15[) 
gives t/ie existence of a z~ E A such that c) £ i. Since L is a cone, 

we get (z + z~,0) £ L and then, the linearity condition of Theorem \10\ gives 
(—(z + z~),0) £ L. Since — (z,c) = (—(z + z~),0) + (z~,— c), we get that 
— (z, c) £ L, i.e., L is linear. 
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